An unconstrained statistical matching algorithm for combining individual and household level geo-specific census and survey data

نویسندگان

  • Mohammad-Reza Namazi-Rad
  • Robert Tanton
  • David Steel
  • Payam Mokhtarian
  • Sumonkanti Das
چکیده

The Population Census is an important source of statistical information in most countries that is capable of producing reliable estimates of population characteristics for small geographic areas. One limitation of a census is that there are many population characteristics that cannot be collected due to respondent burden or cost. This means that statistical agencies have to conduct population based surveys to provide social, economic and demographic characteristics for a target population which are not captured by a large-scale census. These surveys are usually capable of producing direct estimates at the national level and high level regions but often cannot produce reliable estimates for smaller areas. Due to the increasing demand for comprehensive statistical information not only at the national level but also for sub-national domains, there is a wide discussion in the literature about the use of statistical techniques that combine survey with census data to provide more detailed, finer-level estimates. Where censuses and sample surveys are based on the same reporting units, statistical matching techniques can be employed to link the records from survey and census data where exact matching of reporting units is impossible due to confidentiality restrictions. These techniques can then provide the detailed social, economic and demographic information required for small areas. An approach is developed in this paper in which a close-to-reality synthetic population of individuals and households is generated from available census tables using an iterative proportional updating (IPU) method. Statistical matching using a nearest neighbour method is then used to impute survey data to the individuals and households in the synthetic population. To evaluate this approach, 2011 Bangladesh census data is used to generate a district-specific synthetic population of individuals and households. Matching is then performed by imputing the nearest possible records among the 2011 Bangladesh Demographic and Health Survey to estimate the wealth index for each household within the synthetic population. The results show that using the method presented in this paper helps with achieving more representative estimates (comparing with direct survey estimates,) particularly for areas with small sample sizes where not all population units with different sociodemographic characteristics are included.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Donor Imputation System to Create a Census Database Fully Adjusted for Underenumeration

Following the problems with estimating underenumeration in the 1991 Census of England and Wales the aim for the 2001 Census is to create a database that is fully adjusted for net underenumeration. To achieve this, the paper investigates weighted donor imputation methodology that utilises information from both the census and census coverage survey (CCS). The US Census Bureau has considered a sim...

متن کامل

معلولیت (ناتوانی) در ایران: شیوع، ویژگی‏ها و همبسته‏‌های اقتصادی و اجتماعی آن

Objective: The main purpose of this study was to investigate the prevalence and socio-economic correlates of disabilities in Iran. Materials & Methods: Secondary analysis of aggregate and micro data (in individual and household level) from the Iranian 2006 census is used. Micro-data collection contains representative sample including 19,848 disabled people and 1,235,445 non-disabled people i...

متن کامل

Area specific confidence intervals for a small area mean under the Fay-Herriot model

‎Small area estimates have received much attention from both private and public sectors due to the growing demand for effective planning of health services‎, ‎apportioning of government funds and policy and decision making‎. ‎Surveys are generally designed to give representative estimates at national or district level‎, ‎but estimates of variables of interest are oft...

متن کامل

Optimizing image steganography by combining the GA and ICA

In this study, a novel approach which uses combination of steganography and cryptography for hiding information into digital images as host media is proposed. In the process, secret data is first encrypted using the mono-alphabetic substitution cipher method and then the encrypted secret data is embedded inside an image using an algorithm which combines the random patterns based on Space Fillin...

متن کامل

Impact of Small-Holders’ Cattle Fattening on Household Income Generation in Fadis District of Eastern Hararghe Zone, Oromia, Ethiopia

At the household level, livestock plays a critical economic and social role in pastoralists and at the household level, livestock plays a critical economic and social role in pastoralists and smallholder farm households. The objectives of this study were to analyze factors affecting participation in cattle fattening and its impacts on household income in Fadis district of Eastern Hararghe. Both...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers, Environment and Urban Systems

دوره 63  شماره 

صفحات  -

تاریخ انتشار 2017